Isolated novel nucleic acid and protein molecules from soy and methods of using those molecules to generate transgenic plants with enhanced agronomic traits

ABSTRACT

This disclosure provides purified nucleic acids and polypeptides. Also provided are transgenic plants, seeds, and plant cells containing DNA for expression of the proteins that are useful for imparting enhanced agronomic trait(s) to transgenic crop plants, methods of making such plants and methods of making agricultural commodity including seeds and hybrid seeds from such plants.

CROSS REFERENCE TO RELATED APPLICATIONS

This patent application claims priority (as a U.S. National PhaseApplication filed under 35 U.S.C. §371) to International PatentApplication No. PCT/US2009/000655 (filed Feb. 2, 2009), which in turnclaims priority to U.S. Application No. 61/063,633 (filed Feb. 5, 2008).The entire text of each of these priority patent applications isincorporated by reference into this patent.

FIELD OF THE INVENTION

Disclosed herein are inventions in the field of plant genetics anddevelopmental biology. More specifically, this invention provides novelcompositions of soy DNA and peptide molecules. Also disclosed are plantscomprising recombinant DNA providing one or more enhanced traits in atransgenic plant, including cells, seed, and pollen derived from such aplant, as well as methods of making and using such plant.

INCORPORATION-BY-REFERENCE OF MATERIALS FILED ON COMPACT DISC

A sequence listing having the file name“38-21(55306)B-PCT_SeqListing.txt” which has 370,353,645 bytes (measuredin MS-WINDOWS) was created on Jul. 27, 2010 and has 233556 sequences isenclosed herewith in computer readable format and is herein incorporatedby reference.

SUMMARY OF THE INVENTION

Certain embodiments of the disclosed invention provide recombinant DNAconstructs having polynucleotides characterized by reference to SEQ IDNO:1-116778 and the cognate proteins with amino acid sequences havingreference to SEQ ID NO:116779-233556. The recombinant DNA constructs areused in aspects of the various embodiments of the invention to provideenhanced traits when stably integrated into the chromosomes andexpressed in the nuclei of transgenic plants cells. In many aspects ofthese embodiments, the recombinant DNA constructs, when expressed in aplant cell, provide for expression of cognate proteins. In particularaspects of the invention the recombinant DNA constructs for expressingcognate proteins are characterized by cognate amino acid sequence thathave at least 95% identity over at least 95% of the length of areference sequence selected from the group consisting of SEQ ID NOs:116779-233556 when the amino acid sequence is aligned to the referencesequence. In some aspects of the invention, the recombinant DNAconstructs are characterized as being constructed with sense-orientedand/or anti-sense-oriented polynucleotides from the group consisting ofSEQ ID NOs: 1-116778 which, when expressed in a plant cell, provide forthe suppression of cognate proteins having amino acid sequences thathave at least 95% identity over at least 95% of the length of areference sequence selected from the group consisting of SEQ ID NOs:116779-233556.

In certain aspects of this invention the recombinant DNA constructs ofthe invention are stably integrated into the chromosome of a plant cellnucleus.

Certain aspects of this embodiment of the invention provide transgenicplant cells having stably integrated recombinant DNA constructs,transgenic plants and seeds comprising a plurality of such transgenicplant cells and transgenic pollen of such plants. Such transgenic plantscan be selected from a population of transgenic plants regenerated fromplant cells transformed with recombinant DNA constructs by screeningtransgenic plants for an enhanced trait as compared to control plants.The enhanced trait provided may include, but is not limited to, enhancedwater use efficiency, enhanced cold tolerance, increased yield, enhancednitrogen use efficiency, altered seed protein composition, altered seedoil composition, or any combinations thereof.

Other embodiments of the invention provide for plant cells, plants,seeds, and pollen that can further comprise DNA expressing a proteinthat provides tolerance from exposure to an herbicide applied at levelsthat are lethal to a wild type plant cell.

Embodiments of the invention also provide methods for manufacturingnon-natural, transgenic seed that can be used to produce a crop oftransgenic plants with an enhanced trait resulting from expression of astably-integrated recombinant DNA construct. The methods may compriseone or more of the following steps: (a) screening a population of plantsfor an enhanced trait and a recombinant DNA construct, where individualplants in the population can exhibit the trait at a level less than,essentially the same as or greater than the level that the trait isexhibited in control plants, (b) selecting from the population one ormore plants that exhibit the trait at a level greater than the levelthat said trait is exhibited in control plants, (c) collecting seed froma selected plant, (d) verifying that the recombinant DNA is stablyintegrated in said selected plants, (e) analyzing tissue of a selectedplant to determine the production or suppression of a protein having thefunction of a protein encoded by nucleotides in at least one sequenceselected from SEQ ID NOs:1-116778. In certain embodiments of theinvention, the plants in the population further include DNA expressing aprotein that provides tolerance to exposure to a herbicide applied atlevels that are lethal to wild type plant cells and the selecting isaffected by treating the population with the herbicide, e.g. aglyphosate, dicamba, or glufosinate compound. In another embodiment ofthe invention, the plants are selected by identifying plants with theenhanced trait. The methods can be used for the manufacturing corn,soybean, cotton, canola, alfalfa, wheat, rice, sugarcane or sugar beetseed. In other embodiments of the present invention, the methods canalso be used for manufacture transgenic plants including, but are notlimited to, millet, barley, peanut, pigeon pea, sorghum, vegetables(including but not limited to Broccoli, Cauliflower, Cabbage, Radish,Chinese cabbage, Melons, Watermelons, Cucumber, Gourds, Pumpkin, Squash,Pepper, Tomato, Eggplant, Onion, Carrot, Garden Bean, Sweet Corn, Pea,Dry Bean, Okra, Spinach, Leek, Lettuce, and Fennel), grape, berries(including blue, black, raspberry, mullberry, boisenberry, etc), cherryand related fruit trees (including but not limited to plum, peach,apricot, kiwi, pomegranate, mango, fig), fruit trees (including but notlimited to orange, lemon, lime, blood orange, grapefruit, and the like),and nut trees (including but not limited to coconut, walnut (English andblack), pecan, almond, hazelnut, brazil nut, hickory nut, acorn, and thelike) and sunflower, other oilseed producing plants or any combinationsthereof.

Other embodiments of the invention provide a methods for producinghybrid corn seed by acquiring hybrid corn seed from a herbicide tolerantcorn plant which also has stably-integrated, recombinant DNA constructhaving a promoter that is (a) functional in plant cells and (b) isoperably linked to DNA that encodes or suppresses a protein having thefunction of a protein encoded by nucleotides in at least one sequenceselected from the group consisting of SEQ ID NOs:1-116778. The methodsof these embodiments may further include producing corn plants from saidhybrid corn seed, wherein a fraction of the plants produced from saidhybrid corn seed is homozygous for said recombinant DNA, a fraction ofthe plants produced from said hybrid corn seed is hemizygous for saidrecombinant DNA, and a fraction of the plants produced from said hybridcorn seed has none of said recombinant DNA; selecting corn plants whichare homozygous and hemizygous for said recombinant DNA by treating withan herbicide; collecting seed from herbicide-treated-surviving cornplants and planting said seed to produce further progeny corn plants;repeating the selecting and collecting steps at least once to produce aninbred corn line; and crossing the inbred corn line with a second cornline to produce hybrid seed.

Other embodiments of the invention provide methods for selecting a plantcomprising plant cells of the invention by using an immunoreactiveantibody to detect the presence or absence of protein expressed orsuppressed by recombinant DNA in seed or plant tissue. Anotherembodiment of the invention provides anti-counterfeit milled seedhaving, as an indication of origin, plant cells of this invention.

Still other embodiments of this invention provide for transgenic plantswith enhanced water use efficiency or enhanced nitrogen use efficiency.For example, this invention provides methods of growing a corn, cotton,soybean, or canola crop without irrigation water by planting seed havingplant cells of the invention which are selected for enhanced water useefficiency. Alternatively embodiments of these methods include applyingreduced irrigation water, e.g. providing up to 300 millimeters of groundwater during the production of a corn crop. This invention also providesmethods of growing a corn, cotton, soybean or canola crop without addednitrogen fertilizer by planting seed having plant cells of the inventionwhich are selected for enhanced nitrogen use efficiency.

Other embodiments of the invention provide mixtures comprising plantscells and an antibody to a protein produced in the cells where theprotein has an amino acid sequence that has at least 95% identity overat least 95% of the length of a reference sequence selected from thegroup consisting of SEQ ID NO: 116779-233556 when the sequence isaligned to the reference sequence.

ILLUSTRATIVE EMBODIMENTS OF THE INVENTION

While the making and using of various embodiments of the presentinvention are discussed in detail below, it should be appreciated thatthe present invention provides many applicable inventive concepts thatcan be embodied in a wide variety of specific contexts. The specificembodiments discussed herein are merely illustrative of specific ways tomake and use the invention and do not delimit the scope of theinvention.

To facilitate the understanding of this invention, a number of terms aredefined below. Terms defined herein have meanings as commonly understoodby a person of ordinary skill in the areas relevant to the presentinvention. Terms such as “a”, “an” and “the” are not intended to referto only a singular entity, but include the general class of which aspecific example may be used for illustration. The terminology herein isused to describe specific embodiments of the invention, but their usagedoes not delimit the invention, except as outlined in the claims.

In the attached sequence listing:

SEQ ID NO:1-116778 are nucleotide sequences of the coding strand of DNAfor “genes” used in the recombinant DNA imparting an enhanced trait inplant cells, e.g. each comprises a coding sequence for a protein;

SEQ ID NO: 116779-233556 are amino acid sequences of the cognate proteinof the “genes” with nucleotide coding sequences provided by SEQ ID NO:1-116778;

As used herein, a “plant cell” means a plant cell that is transformedwith stably-integrated, non-natural, recombinant DNA, e.g. byAgrobacterium-mediated transformation or by bombardment usingmicroparticles coated with recombinant DNA or other means. A plant cellof this invention can be an originally-transformed plant cell thatexists as a microorganism or as a progeny plant cell that is duplicatedby regeneration into differentiated tissue, e.g. into a transgenic plantwith stably-integrated, non-natural recombinant DNA, or seed or pollenderived from a progeny transgenic plant.

As used herein, a “transgenic plant” means a plant whose genome has beenaltered by the stable integration of recombinant DNA. A transgenic plantincludes a plant regenerated from an originally-transformed plant celland progeny transgenic plants from later generations or crosses of atransformed plant.

A “consensus amino acid sequence” means an artificial, amino acidsequence indicating conserved amino acids in the sequence of homologousproteins as determined by statistical analysis of an optimal alignment,e.g. CLUSTALW, of amino acid sequence of homolog proteins. The consensussequences listed in the sequence listing were created by identifying themost frequent amino acid at each position in a set of aligned proteinsequences. When there was 100% identity in an alignment the amino acidis indicated by a capital letter. When the occurrence of an amino acidis at least about 70% in an alignment, the amino acid is indicated by alower case letter. When there is no amino acid occurrence of at leastabout 70%, e.g. due to diversity or gaps, the amino acid is indicated byan “x”. When used to defined embodiments of the invention, a consensusamino acid sequence will be aligned with a query protein amino acidsequence in an optimal alignment, e.g. CLUSTALW. An embodiment of theinvention will have identity to the conserved amino acids indicated inthe consensus amino acid sequence.

As used herein, “control plant” means a plant that does not contain therecombinant DNA that expressed a protein which imparts an enhancedtrait. A control plant is to identify and select a transgenic plant thathas an enhance trait. A suitable control plant can be a non-transgenicplant of the parental line used to generate a transgenic plant, e.g.,devoid of recombinant DNA. A suitable control plant can in some cases bea progeny of a hemizygous transgenic plant line that is does not containthe recombinant DNA, known as a negative segregant.

As used herein, an “enhanced trait” means a characteristic of atransgenic plant that includes, but is not limited to, an enhanceagronomic trait characterized by enhanced plant morphology, physiology,growth and development, yield, nutritional enhancement, disease or pestresistance, or environmental or chemical tolerance. In more specificaspects of this invention enhanced trait is selected from group ofenhanced traits consisting of enhanced water use efficiency, enhancedcold tolerance, increased yield, enhanced nitrogen use efficiency,enhanced seed protein and enhanced seed oil. In an important aspect ofthe invention the enhanced trait is enhanced yield including increasedyield under non-stress conditions and increased yield underenvironmental stress conditions. Stress conditions can include, forexample, drought, shade, fungal disease, viral disease, bacterialdisease, insect infestation, nematode infestation, cold temperatureexposure, heat exposure, osmotic stress, reduced nitrogen nutrientavailability, reduced phosphorus nutrient availability and high plantdensity. “Yield” can be affected by many properties including withoutlimitation, plant height, pod number, pod position on the plant, numberof internodes, incidence of pod shatter, grain size, efficiency ofnodulation and nitrogen fixation, efficiency of nutrient assimilation,resistance to biotic and abiotic stress, carbon assimilation, plantarchitecture, resistance to lodging, percent seed germination, seedlingvigor, and juvenile traits. Yield can also be affected by efficiency ofgermination (including germination in stressed conditions), growth rate(including growth rate in stressed conditions), ear number, seed numberper ear, seed size, composition of seed (starch, oil, protein) andcharacteristics of seed fill.

Increased yield of a transgenic plant of the present invention can bemeasured in a number of ways, including test weight, seed number perplant, seed weight, seed number per unit area (e.g., seeds, or weight ofseeds, per acre), bushels per acre, tonnes per acre, tons per acre, kiloper hectare. For example, maize yield can be measured as production ofshelled corn kernels per unit of production area in bushels per acre ormetric tons per hectare, often reported on a moisture adjusted basis atabout 15.5 percent moisture. Increased yield can result from improvedutilization of key biochemical compounds such as nitrogen, phosphorousand carbohydrate, or from improved responses to environmental stresses,such as cold, heat, drought, salt, and attack by pests or pathogens.Recombinant DNA used in this invention can also be used to provideplants having improved growth and development, and ultimately increasedyield, as the result of modified expression of plant growth regulatorsor modification of cell cycle or photosynthesis pathways. Also ofinterest is the generation of transgenic plants that demonstrateenhanced yield with respect to a seed component that can correspond toan increase in overall plant yield. Such properties include enhancementsin seed oil, seed molecules such as tocopherol, protein and starch, oroil, particular oil components as can be manifest by alterations in theratios of seed components.

Seed according to the present invention may be planted, grown andharvested to produce a crop or terminal crop. As used herein, a “crop”is a plant or plant product that is grown and harvested, such plant orplant product including but not limited to plants or plant parts such asleaf, root, shoot, fruit, seed, grain, or the like. A “terminal crop” isa crop grown for uses other than for use as planting seed to producesubsequent generations of plants. In some crop plants, such as grainproduced from hybrid corn, the crop is not very suitable for plantingbecause it does not breed true and the crop can then be convenientlyreferred to as “hybrid grain.” In other crop plants, where the crop doesbreed true, such as soybean, whether a crop is planting seed or aterminal crop will depend on the uses and marketing channels of thecrop. If used or marketed for planting, it will be a crop of plantingseed; if used or marketed for other purposes it will be a terminal crop.

As used herein, “exogenous promoter region” refers to a sequence,capable of promoting mRNA transcription, that does not naturally occurin the plant at the same site and/or linked to the nucleic acids. Thepromoter can be from a different plant species or it can be from thesame plant species, but naturally found in a different location in anon-genetically modified plant. Moreover, the promoter region can befound in the same genetic locus as is present a native plant, but linkedto different sequence(s), than are native.

As used herein, “promoter” means regulatory DNA for initializingtranscription. A “plant promoter” is a promoter capable of initiatingtranscription in plant cells whether or not its origin is a plant cell,e.g. is it well known that Agrobacterium promoters are functional inplant cells. Thus, plant promoters include promoter DNA obtained fromplants, plant viruses and bacteria such as Agrobacterium andBradyrhizobium bacteria. Examples of promoters under developmentalcontrol include promoters that preferentially initiate transcription incertain tissues, such as leaves, roots, or seeds. Such promoters arereferred to as “tissue preferred”. Promoters that initiate transcriptiononly in certain tissues are referred to as “tissue specific”. A “celltype” specific promoter primarily drives expression in certain celltypes in one or more organs, for example, vascular cells in roots orleaves. An “inducible” or “repressible” promoter is a promoter which isunder environmental control. Examples of environmental conditions thatcan effect transcription by inducible promoters include anaerobicconditions, or certain chemicals, or the presence of light. Tissuespecific, tissue preferred, cell-type specific, and inducible promotersconstitute the class of “non-constitutive” promoters. A “constitutive”promoter is a promoter which is active under most conditions.

As used herein, “expressed” means produced, e.g. a protein is expressedin a plant cell when its cognate DNA is transcribed to mRNA that istranslated to the protein.

As used herein, an “expression cassette of a DNA construct” is capableof integrating in a plant genome, expressing a functional polypeptideand providing a transgenic plant expressing polypeptides of theinvention.

As used herein, “suppressed” means decreased, e.g. a protein issuppressed in a plant cell when there is a decrease in the amount and/oractivity of the protein in the plant cell. The presence or activity ofthe protein can be decreased by any amount up to and including a totalloss of protein expression and/or activity.

As used herein, a “functional fragment” refers to a portion of apolypeptide provided herein which retains full or partial molecular,physiological or biochemical function of the full length polypeptide. Afunctional fragment often contains the domain(s), such as Pfam domain,identified in the polypeptide provided in the sequence listing.

As used herein, a “homolog” means a protein in a group of proteins thatperform the same biological function, e.g. proteins that belong to thesame Pfam protein family and that provide a common enhanced trait intransgenic plants of this invention. Homologs are expressed byhomologous genes. With reference to homologous genes, homologs includeorthologs, e.g., genes expressed in different species that evolved froma common ancestral genes by speciation and encode proteins retain thesame function, but do not include paralogs, e.g., genes that are relatedby duplication but have evolved to encode proteins with differentfunctions. Homologous genes include naturally occurring alleles andartificially-created variants. Degeneracy of the genetic code providesthe possibility to substitute at least one base of the protein encodingsequence of a gene with a different base without causing the amino acidsequence of the polypeptide produced from the gene to be changed. Whenoptimally aligned, homolog proteins have typically at least about 60%identity, in some instances at least about 70%, for example about 80%and even at least about 90% identity over the full length of a proteinidentified as being associated with imparting an enhanced trait whenexpressed in plant cells. In one aspect of the invention homologproteins have an amino acid sequence that has at least 90% identity to aconsensus amino acid sequence of proteins and homologs disclosed herein.

Homologs are identified by comparison of amino acid sequence, e.g.manually or by use of a computer-based tool using known homology-basedsearch algorithms such as those commonly known and referred to as BLAST,FASTA, and Smith-Waterman. A local sequence alignment program, e.g.BLAST, can be used to search a database of sequences to find similarsequences, and the summary Expectation value (E-value) used to measurethe sequence base similarity. Because a protein hit with the bestE-value for a particular organism may not necessarily be an ortholog,e.g., have the same function, or be the only ortholog, a reciprocalquery is used to filter hit sequences with significant E-values forortholog identification. The reciprocal query entails search of thesignificant hits against a database of amino acid sequences from thebase organism that are similar to the sequence of the query protein. Ahit can be identified as an ortholog, when the reciprocal query's besthit is the query protein itself or a protein encoded by a duplicatedgene after speciation. A further aspect of the homologs encoded by DNAuseful in the transgenic plants of the invention are those proteins thatdiffer from a disclosed protein as the result of deletion or insertionof one or more amino acids in a native sequence.

Other functional homolog proteins differ in one or more amino acids fromthose of a trait-improving protein disclosed herein as the result of oneor more of the well-known conservative amino acid substitutions, e.g.,valine is a conservative substitute for alanine and threonine is aconservative substitute for serine. Conservative substitutions for anamino acid within the native sequence can be selected from other membersof a class to which the naturally occurring amino acid belongs.Representative amino acids within these various classes include, but arenot limited to: (1) acidic (negatively charged) amino acids such asaspartic acid and glutamic acid; (2) basic (positively charged) aminoacids such as arginine, histidine, and lysine; (3) neutral polar aminoacids such as glycine, serine, threonine, cysteine, tyrosine,asparagine, and glutamine; and (4) neutral nonpolar (hydrophobic) aminoacids such as alanine, leucine, isoleucine, valine, proline,phenylalanine, tryptophan, and methionine. Conserved substitutes for anamino acid within a native amino acid sequence can be selected fromother members of the group to which the naturally occurring amino acidbelongs. For example, a group of amino acids having aliphatic sidechains is glycine, alanine, valine, leucine, and isoleucine; a group ofamino acids having aliphatic-hydroxyl side chains is serine andthreonine; a group of amino acids having amide-containing side chains isasparagine and glutamine; a group of amino acids having aromatic sidechains is phenylalanine, tyrosine, and tryptophan; a group of aminoacids having basic side chains is lysine, arginine, and histidine; and agroup of amino acids having sulfur-containing side chains is cysteineand methionine. Naturally conservative amino acids substitution groupsare: valine-leucine, valine-isoleucine, phenylalanine-tyrosine,lysine-arginine, alanine-valine, aspartic acid-glutamic acid, andasparagine-glutamine. A further aspect of the invention includesproteins that differ in one or more amino acids from those of adescribed protein sequence as the result of deletion or insertion of oneor more amino acids in a native sequence.

Genes that are homologous to each other can be grouped into families andincluded in multiple sequence alignments. Then a consensus sequence foreach group can be derived. This analysis enables the derivation ofconserved and class-(family) specific residues or motifs that arefunctionally important. These conserved residues and motifs can befurther validated with 3D protein structure if available. The consensussequence can be used to define the full scope of the invention, e.g., toidentify proteins with a homolog relationship. Thus, the presentinvention contemplates that protein homologs include proteins with anamino acid sequence that has at least 90% identity to such a consensusamino acid sequence sequences.

As used herein, “operably linked” refers to the association of two ormore nucleic acid elements in a recombinant DNA construct, e.g. as whena promoter is operably linked with DNA that is transcribed to RNAwhether for expressing or suppressing a protein. Recombinant DNAconstructs can be designed to express a protein which can be anendogenous protein, an exogenous homologue of an endogenous protein oran exogenous protein with no native homologue. Alternatively,recombinant DNA constructs can be designed to suppress the level of anendogenous protein, e.g. by suppression of the native gene. Such genesuppression can be effectively employed through a native RNAinterference (RNAi) mechanism in which recombinant DNA comprises bothsense and anti-sense oriented DNA matched to the gene targeted forsuppression where the recombinant DNA is transcribed into RNA that canform a double-strand to initiate an RNAi mechanism. Gene suppression canalso be effected by recombinant DNA that comprises anti-sense orientedDNA matched to the gene targeted for suppression. Gene suppression canalso be effected by recombinant DNA that comprises DNA that istranscribed to a microRNA matched to the gene targeted for suppression.In the examples illustrating the invention recombinant DNA for effectinggene suppression that imparts is identified by the term “antisense”. Itwill be understood by a person of ordinary skill in the art that any ofthe ways of effecting gene suppression are contemplated and enabled by ashowing of one approach to gene suppression.

As used herein, “percent identity” means the extent to which twooptimally aligned DNA or protein segments are invariant throughout awindow of alignment of components, for example nucleotide sequence oramino acid sequence. An “identity fraction” for aligned segments of atest sequence and a reference sequence is the number of identicalcomponents that are shared by sequences of the two aligned segmentsdivided by the total number of sequence components in the referencesegment over a window of alignment which is the smaller of the full testsequence or the full reference sequence. “Percent identity” (“%identity”) is the identity fraction times 100. Such optimal alignment isunderstood to be deemed as local alignment of DNA sequences. For proteinalignment, a local alignment of protein sequences should allowintroduction of gaps to achieve optimal alignment. Percent identity iscalculated over the aligned length not including the gaps introduced bythe alignment per se.

As used herein, a “plant by-product” includes any product that is madefrom a plant or plant product, for example, by dehulling, crushing,milling, extraction, hydrogenation, and other processes. A plantby-products in accordance with the invention, therefore, will includesuch as, for example, dehulled soybeans, crushed corn, soybean meal, soymilk, paper made from corn stalks, and a wide range of other usefulproducts of processing based on plant vitamins, minerals, lipids,proteins and carbohydrates and their constituents that can becharacterized as being produced from crops or terminal crops inaccordance with the invention.

As used herein, a “plant cell” means a plant cell that is transformedwith stably-integrated, recombinant DNA, e.g. by Agrobacterium-mediatedtransformation or by bombardment using microparticles coated withrecombinant DNA or other means. A plant cell of this invention can be anoriginally-transformed plant cell that exists as a microorganism or as aprogeny plant cell that is regenerated into differentiated tissue, e.g.into a transgenic plant with stably-integrated, non-natural recombinantDNA, or seed or pollen derived from a progeny transgenic plant.

As used herein, the term “polypeptide” or “polypeptide molecule” means achain of amino acids. Polypeptide is also commonly referred as“protein”.

As used herein, “polyadenylated ribonucleotides” refers to the series ofadenosines at the 3′ end of a polyribonucleotide commonly referred to asa “poly-A tail”.

As used herein, “recombinant DNA” means DNA which has been a geneticallyengineered and constructed outside of a cell including DNA containingnaturally occurring DNA or cDNA or synthetic DNA.

As used herein, the term “structural nucleic acid molecule” refers to amolecule having sequence that encodes a protein, functional peptidefragment, or any other molecule that has biological activity (including,but not limited to, mRNA and bioactive RNA molecules, includingantisense RNA).

As used herein, the term “substantially purified nucleic acid” orpolypeptide means nucleic acid or protein separated from substantiallyall other molecules normally associated with it in its native state. Asubstantially purified nucleic acid can be greater than about 60% freefrom the other molecules (exclusive of solvent) present in the naturalmixture. The term “substantially purified” is not intended to encompassmolecules present in their native state.

“Pfam” database is a large collection of multiple sequence alignmentsand hidden Markov models covering many common protein families, e.g.Pfam version 19.0 (December 2005) contains alignments and models for8183 protein families and is based on the Swissprot 47.0 and SP-TrEMBL30.0 protein sequence databases. See S. R. Eddy, “Profile Hidden MarkovModels”, Bioinformatics 14:755-763, 1998. The Pfam database is currentlymaintained and updated by the Pfam Consortium. The alignments representsome evolutionary conserved structure that has implications for theprotein's function. Profile hidden Markov models (profile HMMs) builtfrom the protein family alignments are useful for automaticallyrecognizing that a new protein belongs to an existing protein familyeven if the homology by alignment appears to be low.

Recombinant DNA Constructs

Recombinant DNA constructs are assembled using methods well known topersons of ordinary skill in the art and typically comprise a promoteroperably linked to DNA, the expression of which provides the enhancedagronomic trait. Other construct components can include additionalregulatory elements, such as 5′ leaders and introns for enhancingtranscription, 3′ untranslated regions (such as polyadenylation signalsand sites), DNA for transit or signal peptides.

Numerous promoters that are active in plant cells have been described inthe literature. These include promoters present in plant genomes as wellas promoters from other sources, including nopaline synthase (NOS)promoter and octopine synthase (OCS) promoters carried on tumor-inducingplasmids of Agrobacterium tumefaciens and the CaMV35S promoters from thecauliflower mosaic virus as disclosed in U.S. Pat. Nos. 5,164, 316 and5,322,938. Useful promoters derived from plant genes are found in U.S.Pat. No. 5,641,876 which discloses a rice actin promoter, U.S. Pat. No.7,151,204 which discloses a maize chloroplast aldolase promoter and amaize aldolase (FDA) promoter, and US Patent Application Publication2003/0131377 A1 which discloses a maize nicotianamine synthase promoter.These and numerous other promoters that function in plant cells areknown to those skilled in the art and available for use in recombinantpolynucleotides of the present invention to provide for expression ofdesired genes in transgenic plant cells.

Furthermore, the promoters can be altered to contain multiple “enhancersequences” to assist in elevating gene expression. Such enhancers areknown in the art. By including an enhancer sequence with suchconstructs, the expression of the selected protein can be enhanced.These enhancers often are found 5′ to the start of transcription in apromoter that functions in eukaryotic cells, but can often be insertedupstream (5′) or downstream (3′) to the coding sequence. In someinstances, these 5′ enhancing elements are introns. Particularly usefulas enhancers are the 5′ introns of the rice actin 1 (see U.S. Pat. No.5,641,876) and rice actin 2 genes, the maize alcohol dehydrogenase geneintron, the maize heat shock protein 70 gene intron (U.S. Pat. No.5,593,874) and the maize shrunken 1 gene. See also US Patent ApplicationPublication 2002/0192813A1 which discloses 5′, 3′ and intron elementsuseful in the design of effective plant expression vectors.

In other aspects of the invention, sufficient expression in plant seedtissues is desired to affect improvements in seed composition. Exemplarypromoters for use for seed composition modification include promotersfrom seed genes such as napin as disclosed in U.S. Pat. No. 5,420,034,maize L3 oleosin as disclosed in U.S. Pat. No. 6,433,252), zein Z27 asdisclosed by Russell et al. (1997) Transgenic Res. 6(2):157-166),globulin 1 as disclosed by Belanger et at (1991) Genetics 129:863-872),glutelin 1 as disclosed by Russell (1997) supra), and peroxiredoxinantioxidant (Per1) as disclosed by Stacy et al. (1996) Plant Mol Biol.31(6):1205-1216.

Recombinant DNA constructs in this invention will generally include a 3′element that typically contains a polyadenylation signal and site.Well-known 3′ elements include those from Agrobacterium tumefaciensgenes such as nos 3′, tml 3′, tmr 3′, tms 3, ocs 3′, tr7 3′, for exampledisclosed in U.S. Pat. No. 6,090,627; 3′ elements from plant genes suchas wheat (Triticum aesevitum) heat shock protein 17 (Hsp17 3), a wheatubiquitin gene, a wheat fructose-1,6-biphosphatase gene, a rice glutelingene, a rice lactate dehydrogenase gene and a rice beta-tubulin gene,all of which are disclosed in US Patent Application Publication Number2002/0192813 A1; and the pea (Pisum sativum) ribulose biphosphatecarboxylase gene (rbs 3′), and 3′ elements from the genes within thehost plant.

Constructs and vectors can also include a transit peptide for targetingof a gene to a plant organelle, particularly to a chloroplast,leucoplast or other plastid organelle. For descriptions of the use ofchloroplast transit peptides see U.S. Pat. No. 5,188,642 and U.S. Pat.No. 5,728,925. For description of the transit peptide region of anArabidopsis EPSPS gene useful in the present invention, see Klee, H. J.et al (MGG (1987) 210:437-442).

Recombinant DNA constructs for gene suppression can be designed for anyof a number the well-known methods for suppressing transcription of agene, the accumulation of the mRNA corresponding to that gene orpreventing translation of the transcript into protein.Posttranscriptional gene suppression can be practically effected bytranscription of RNA that forms double-stranded RNA (dsRNA) havinghomology to mRNA produced from a gene targeted for suppression.

Gene suppression can also be achieved by insertion mutations created bytransposable elements can also prevent gene function. For example, inmany dicot plants, transformation with the T-DNA of Agrobacterium can bereadily achieved and large numbers of transformants can be rapidlyobtained. Also, some species have lines with active transposableelements that can efficiently be used for the generation of largenumbers of insertion mutations, while some other species lack suchoptions. Mutant plants produced by Agrobacterium or transposonmutagenesis and having altered expression of a polypeptide of interestcan be identified using the polynucleotides of the present invention.For example, a large population of mutated plants can be screened withpolynucleotides encoding the polypeptide of interest to detect mutatedplants having an insertion in the gene encoding the polypeptide ofinterest.

Transgenic plants comprising or derived from plant cells of thisinvention transformed with recombinant DNA can be further enhanced withstacked traits, e.g. a crop plant having an enhanced trait resultingfrom expression of DNA disclosed herein in combination with herbicideand/or pest resistance traits. For example, genes of the currentinvention can be stacked with other traits of agronomic interest, suchas a trait providing herbicide resistance, or insect resistance, such asusing a gene from Bacillus thuringensis to provide resistance againstlepidopteran, coliopteran, homopteran, hemiopteran, and other insects.Herbicides for which transgenic plant tolerance has been demonstratedand the method of the present invention can be applied include, but arenot limited to, glyphosate, dicamba, glufosinate, sulfonylurea,bromoxynil and norflurazon herbicides. Polynucleotide molecules encodingproteins involved in herbicide tolerance are well-known in the art andinclude, but are not limited to, a polynucleotide molecule encoding5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) disclosed in U.S.Pat. Nos. 5,094,945; 5,627,061; 5,633,435 and 6,040,497 for impartingglyphosate tolerance; polynucleotide molecules encoding a glyphosateoxidoreductase (GOX) disclosed in U.S. Pat. No. 5,463,175 and aglyphosate-N-acetyl transferase (GAT) disclosed in U.S. PatentApplication Publication Number 2003/0083480 A1 also for impartingglyphosate tolerance; dicamba monooxygenase disclosed in U.S. PatentApplication Publication Number 2003/0135879 A1 for imparting dicambatolerance; a polynucleotide molecule encoding bromoxynil nitrilase (Bxn)disclosed in U.S. Pat. No. 4,810,648 for imparting bromoxynil tolerance;a polynucleotide molecule encoding phytoene desaturase (crtl) describedin Misawa et al, (1993) Plant J. 4:833-840 and in Misawa et al, (1994)Plant J. 6:481-489 for norflurazon tolerance; a polynucleotide moleculeencoding acetohydroxyacid synthase (AHAS, aka ALS) described inSathasiivan et al. (1990) Nucl. Acids Res. 18:2188-2193 for impartingtolerance to sulfonylurea herbicides; polynucleotide molecules known asbar genes disclosed in DeBlock, et al. (1987) EMBO J. 6:2513-2519 forimparting glufosinate and bialaphos tolerance; polynucleotide moleculesdisclosed in U.S. Patent Application Publication Number 2003/010609 A1for imparting N-amino methyl phosphonic acid tolerance; polynucleotidemolecules disclosed in U.S. Pat. No. 6,107,549 for imparting pyridineherbicide resistance; molecules and methods for imparting tolerance tomultiple herbicides such as glyphosate, atrazine, ALS inhibitors,isoxoflutole and glufosinate herbicides are disclosed in U.S. Pat. No.6,376,754 and U.S. Patent Application Publication Number 2002/0112260.Molecules and methods for imparting insect/nematode/virus resistance aredisclosed in U.S. Pat. Nos. 5,250,515; 5,880,275; 6,506,599; 5,986,175and U.S. Patent Application Publication Number 2003/0150017 A1.

Embodiments of the invention provide molecules of that include“fragments” of the disclosed recombinant DNA molecules; includingoligonucleotides of at least 15, at least 16 or 17, at least 18 or 19,and at least 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 or more,consecutive nucleotides of any of the sequences provided. Sucholigonucleotides are fragments of the larger molecules having a sequenceselected from the group consisting of SEQ ID NO: 1 through SEQ ID NO:116778, and find use, for example as probes and primers for detection ofthe polynucleotides of the present invention.

Aspects of the various embodiments of the invention also provide fornucleic acid fragments of SEQ ID NO: 1 through SEQ ID NO: 116778 thatare at least about 50, 100, 150, 200, 400, 500 or more nucleotides inlength. Some aspects of these embodiments provide for nucleic acidmolecules that encode functional fragments of any of the polypeptidesequences provided in SEQ ID NO: 116779 to SEQ ID NO: 233556.

Other embodiments of the present invention provide for one or morepolypeptides having at least about 10 contiguous peptide residues of oneor more of the peptide sequences provided in SEQ ID NO: 116779 to SEQ IDNO: 233556. In other aspects of these embodiment, the polypeptide(s)comprises at least 15, 20, 25, 30, 35, 50, 75, 100 or more contiguouspeptide residues from one or more of the sequences provided in SEQ IDNO: 116779 to SEQ ID NO: 233556. In particularly aspects, theseembodiments the poly peptide includes a functional fragment of one ormore of the polypeptides provided in the Sequence Listing.

Other aspects of the various embodiments of the invention includes,transgenic plants, transgenic plant seeds, transgenic crops, plantproducts and byproducts having any of the nucleic acid or proteinfragments described above. The invention also provides for variousmethods that use such fragments.

Embodiments of the present invention also contemplate that thetrait-improving recombinant DNA provided herein can be used incombination with other recombinant DNA to create plants with multipledesired traits or a further enhanced trait. The combinations generatedcan include multiple copies of any one or more of the recombinant DNAconstructs. These stacked combinations can be created by any method,including but not limited to cross breeding of transgenic plants, ormultiple genetic transformation.

Plant Cell Transformation Methods

Numerous methods for transforming chromosomes in a plant cell nucleuswith recombinant DNA are known in the art and are used in methods ofpreparing a transgenic plant cell nucleus cell, and plant. Two effectivemethods for such transformation are Agrobacterium-mediatedtransformation and microprojectile bombardment. Microprojectilebombardment methods are illustrated in U.S. Pat. Nos. 5,015,580(soybean); 5,550,318 (corn); 5,538,880 (corn); 5,914,451 (soybean);6,160,208 (corn); 6,399,861 (corn); 6,153,812 (wheat) and 6,365,807(rice) and Agrobacterium-mediated transformation is described in U.S.Pat. Nos. 5,159,135 (cotton); 5,824,877 (soybean); 5,463,174 (canola);5,591,616 (corn); 5,846,797 (cotton); 6,384,301 (soybean), 7,026,528(wheat) and 6,329,571 (rice), US Patent Application Publication2004/0087030 A1 (cotton), and US Patent Application Publication2001/0042257 A1 (sugar beet), all of which are incorporated herein byreference for enabling the production of transgenic plants.Transformation of plant material is practiced in tissue culture on anutrient media, e.g., a mixture of nutrients that will allow cells togrow in vitro. Recipient cell targets include, but are not limited to,meristem cells, hypocotyls, calli, immature embryos and gametic cellssuch as microspores, pollen, sperm and egg cells. Callus can beinitiated from tissue sources including, but not limited to, immatureembryos, hypocotyls, seedling apical meristems, microspores and thelike. Cells containing a transgenic nucleus are grown into transgenicplants.

In addition to direct transformation of a plant material with arecombinant DNA, a transgenic plant cell nucleus can be prepared bycrossing a first plant having cells with a transgenic nucleus withrecombinant DNA with a second plant lacking the transgenic nucleus. Forexample, recombinant DNA can be introduced into a nucleus from a firstplant line that is amenable to transformation to transgenic nucleus incells that are grown into a transgenic plant which can be crossed with asecond plant line to introgress the recombinant DNA into the secondplant line. A transgenic plant with recombinant DNA providing anenhanced trait, e.g. enhanced yield, can be crossed with transgenicplant line having other recombinant DNA that confers another trait, forexample herbicide resistance or pest resistance, to produce progenyplants having recombinant DNA that confers both traits. Typically, insuch breeding for combining traits the transgenic plant donating theadditional trait is a male line and the transgenic plant carrying thebase traits is the female line. The progeny of this cross will segregatesuch that some of the plants will carry the DNA for both parental traitsand some will carry DNA for one parental trait; such plants can beidentified by markers associated with parental recombinant DNA, e.g.marker identification by analysis for recombinant DNA or, in the casewhere a selectable marker is linked to the recombinant, by applicationof the selecting agent such as a herbicide for use with a herbicidetolerance marker, or by selection for the enhanced trait. Progeny plantscarrying DNA for both parental traits can be crossed back into thefemale parent line multiple times, for example usually 6 to 8generations, to produce a progeny plant with substantially the samegenotype as one original transgenic parental line but for therecombinant DNA of the other transgenic parental line.

In certain embodiments of the invention the recombinant DNA insertion is“targeted” in order to achieve site-specific integration, for example toreplace an existing gene in the genome, to use an existing promoter inthe plant genome, or to insert a recombinant polynucleotide at apredetermined site known to be active for gene expression. Several sitespecific recombination systems exist which are known to functionimplants include cre-lox as disclosed in U.S. Pat. No. 4,959,317 andFLP-FRT as disclosed in U.S. Pat. No. 5,527,695, both incorporatedherein by reference.

Transformation methods of this invention can be practiced in tissueculture on media and in a controlled environment. “Media” refers to thenumerous nutrient mixtures that are used to grow cells in vitro, thatis, outside of the intact living organism. Recipient cell targetsinclude, but are not limited to, meristem cells, callus, immatureembryos and gametic cells such as microspores, pollen, sperm and eggcells. It is contemplated that any cell from which a fertile plant canbe regenerated is useful as a recipient cell. Callus can be initiatedfrom tissue sources including, but not limited to, immature embryos,seedling apical meristems, microspores and the like. Cells capable ofproliferating as callus are also recipient cells for genetictransformation. Practical transformation methods and materials formaking transgenic plants of this invention, for example, various mediaand recipient target cells, transformation of immature embryo cells andsubsequent regeneration of fertile transgenic plants are disclosed inU.S. Pat. Nos. 6,194,636 and 6,232,526, which are incorporated herein byreference.

The seeds of transgenic plants can be harvested from fertile transgenicplants and be used to grow progeny generations of transformed plants ofthis invention including hybrid plants line for selection of plantshaving an enhanced trait. In addition to direct transformation of aplant with a recombinant DNA, transgenic plants can be prepared bycrossing a first plant having a recombinant DNA with a second plantlacking the DNA. For example, recombinant DNA can be introduced intofirst plant line that is amenable to transformation to produce atransgenic plant which can be crossed with a second plant line tointrogress the recombinant DNA into the second plant line. A transgenicplant with recombinant DNA providing an enhanced trait, e.g. enhancedyield, can be crossed with transgenic plant line having otherrecombinant DNA that confers another trait, for example herbicideresistance or pest resistance, to produce progeny plants havingrecombinant DNA that confers both traits. Typically, in such breedingfor combining traits the transgenic plant donating the additional traitis a male line and the transgenic plant carrying the base traits is thefemale line. The progeny of this cross will segregate such that some ofthe plants will carry the DNA for both parental traits and some willcarry DNA for one parental trait; such plants can be identified bymarkers associated with parental recombinant DNA, e.g. markeridentification by analysis for recombinant DNA or, in the case where aselectable marker is linked to the recombinant, by application of theselecting agent such as a herbicide for use with a herbicide tolerancemarker, or by selection for the enhanced trait. Progeny plants carryingDNA for both parental traits can be crossed back into the female parentline multiple times, for example usually 6 to 8 generations, to producea progeny plant with substantially the same genotype as one originaltransgenic parental line but for the recombinant DNA of the othertransgenic parental line.

Descriptions of commonly used breeding terms such as “crossing”,“hybrids” and methods for crossing and producing hybrid that are used todescribe present invention can be found in one of several referencebooks (Allard, “Principles of Plant Breeding,” John Wiley & Sons, N.Y.,U. of CA, Davis, Calif., 50-98, 1960; Simmonds, “Principles of cropimprovement,” Longman, Inc., N.Y., 369-399, 1979; Sneep and Hendriksen,“Plant breeding perspectives,” Wageningen (ed), Center for AgriculturalPublishing and Documentation, 1979; Fehr, In: Soybeans: Improvement,Production and Uses, 2nd Edition, Monograph., 16:249, 1987; Fehr,“Principles of variety development,” Theory and Technique, (Vol. 1) andCrop Species Soybean (Vol. 2), Iowa State Univ., Macmillan Pub. Co.,N.Y., 360-376, 1987).

In some embodiments of the invention, during transformation, DNA isintroduced into only a small percentage of target plant cells in any onetransformation. Marker genes are used to provide an efficient system foridentification of those cells that are stably transformed by receivingand integrating a transgenic DNA construct into their genomes. Markergenes provide selective markers which confer resistance to a selectiveagent, such as an antibiotic or herbicide. Any of the herbicides towhich plants of this invention can be resistant are useful agents forselective markers. Potentially transformed cells are exposed to theselective agent. In the population of surviving cells will be thosecells where, generally, the resistance-conferring gene is integrated andexpressed at sufficient levels to permit cell survival. Cells can betested further to confirm stable integration of the exogenous DNA.Commonly used selective marker genes include those conferring resistanceto antibiotics such as kanamycin and paromomycin (nptII), hygromycin B(aph IV) and gentamycin (aac3 and aacC4) or resistance to herbicidessuch as glufosinate (bar or pat) and glyphosate (aroA or EPSPS).Examples of such selectable are illustrated in U.S. Pat. Nos. 5,550,318;5,633,435; 5,780,708 and 6,118,047, all of which are incorporated hereinby reference. Selectable markers which provide an ability to visuallyidentify transformants can also be employed, for example, a geneexpressing a colored or fluorescent protein such as a luciferase orgreen fluorescent protein (GFP) or a gene expressing a betaglucuronidase or uidA gene (GUS) for which various chromogenicsubstrates are known.

Plant cells that survive exposure to the selective agent, or plant cellsthat have been scored positive in a screening assay, can be cultured inregeneration media and allowed to mature into plants. Developingplantlets regenerated from transformed plant cells can be transferred toplant growth mix, and hardened off, for example, in an environmentallycontrolled chamber at about 85% relative humidity, 600 ppm CO₂, and25-250 microeinsteins m⁻²s⁻¹ of light, prior to transfer to a greenhouseor growth chamber for maturation. Plants are regenerated from about 6weeks to 10 months after a transformant is identified, depending on theinitial tissue. Plants can be pollinated using conventional plantbreeding methods known to those of skill in the art and seed produced.The regenerated transformed plant or its progeny seed or plants can betested for expression of the recombinant DNA and selected for thepresence of enhanced agronomic trait.

Progeny can be recovered from transformed plants and tested forexpression of the exogenous recombinant polynucleotide. Useful assaysinclude, for example, “molecular biological” assays, such as Southernand Northern blotting and PCR; “biochemical” assays, such as detectingthe presence of RNA, e.g., double stranded RNA, or a protein product,e.g., by immunological means (ELISAs and Western blots) or by enzymaticfunction; plant part assays, such as leaf or root assays; and also, byanalyzing the phenotype of the whole regenerated plant.

Transgenic Plants and Seeds

Transgenic plants derived from the plant cells of this invention aregrown to generate transgenic plants having an enhanced trait as comparedto a control plant and produce transgenic seed and haploid pollen ofthis invention. Such plants with enhanced traits are identified byselection of transformed plants or progeny seed for the enhanced trait.For efficiency a selection method is designed to evaluate multipletransgenic plants (events) including the recombinant DNA, for examplemultiple plants from 2 to 20 or more transgenic events. Transgenicplants grown from transgenic seed provided herein demonstrate improvedagronomic traits that contribute to increased yield or other trait thatprovides increased plant value, including, for example, improved seedquality. Of particular interest are plants having enhanced water useefficiency, enhanced cold tolerance, increased yield, enhanced nitrogenuse efficiency, enhanced seed protein and enhanced seed oil. Transgenicplants of the present invention include, but are not limited to, corn,soybean, cotton, canola, alfalfa, wheat, rice, sugarcane, sugar beetseed, millet, barley, peanut, pigeon pea, sorghum, vegetables (includingbut not limited to Broccoli, Cauliflower, Cabbage, Radish, Chinesecabbage, Melons, Watermelons, Cucumber, Gourds, Pumpkin, Squash, Pepper,Tomato, Eggplant, Onion, Carrot, Garden Bean, Sweet Corn, Pea, Dry Bean,Okra, Spinach, Leek, Lettuce, and Fennel), grape, berries (includingblue, black, raspberry, mullberry, boysenberry . . . etc), cherry andrelated fruit trees (including but not limited to plum, peach, apricot,kiwi, pomegranate, mango, fig), fruit trees (including but not limitedto orange, lemon, lime, blood orange, grapefruit, and the like), nuttrees (including but not limited to coconut, walnut (English and black),pecan, almond, hazelnut, brazil nut, hickory nut, acorn, and the like),sunflower, other oilseed producing plants or any combinations thereof.

Selection Methods for Transgenic Plants with Enhanced Agronomic Trait

Within a population of transgenic plants each regenerated from a plantcell having a nucleus with recombinant DNA many plants that survive tofertile transgenic plants that produce seeds and progeny plants may notexhibit an enhanced agronomic trait. Selection from such population isnecessary to identify one or more transgenic plant cells having atransgenic nucleus that can provide plants with the enhanced trait.Transgenic plants having enhanced traits are selected from populationsof plants regenerated or derived from plant cells transformed asdescribed herein by evaluating the plants in a variety of assays todetect an enhanced trait. These assays also can take many formsincluding, but not limited to, direct screening for the trait in agreenhouse or field trial or by screening for a surrogate trait. Suchanalyses can be directed to detecting changes in the chemicalcomposition, biomass, physiological properties, morphology of the plant.Changes in chemical compositions such as nutritional composition ofgrain can be detected by analysis of the seed composition and content ofprotein, free amino acids, oil, free fatty acids, starch or tocopherols.Changes in biomass characteristics can be made on greenhouse or fieldgrown plants and can include plant height, stem diameter, root and shootdry weights; and, for corn plants, ear length and diameter. Changes inphysiological properties can be identified by evaluating responses tostress conditions, for example assays using imposed stress conditionssuch as water deficit, nitrogen deficiency, cold growing conditions,pathogen or insect attack or light deficiency, or increased plantdensity. Changes in morphology can be measured by visual observation oftendency of a transformed plant with an enhanced agronomic trait to alsoappear to be a normal plant as compared to changes toward bushy, taller,thicker, narrower leaves, striped leaves, knotted trait, chlorosis,albino, anthocyanin production, or altered tassels, ears or roots. Otherselection properties include days to pollen shed, days to silking, leafextension rate, chlorophyll content, leaf temperature, stand, seedlingvigor, internode length, plant height, leaf number, leaf area,tillering, brace roots, stay green, stalk lodging, root lodging, planthealth, barreness/prolificacy, green snap, and pest resistance. Inaddition, phenotypic characteristics of harvested grain can beevaluated, including number of kernels per row on the ear, number ofrows of kernels on the ear, kernel abortion, kernel weight, kernel size,kernel density and physical grain quality.

Assays for screening for a desired trait are readily designed by thosepracticing in the art. The following illustrates screening assays forcorn traits using hybrid corn plants. The assays can be readily adaptedfor screening other plants such as canola, cotton and soybean either ashybrids or inbreds.

In certain embodiments of the invention transgenic corn plants havingnitrogen use efficiency can be identified by screening in fields withthree levels of nitrogen (N) fertilizer being applied, e.g. low level (0N), medium level (80 lb/ac) and high level (180 lb/ac). Plants withenhanced nitrogen use efficiency provide higher yield as compared tocontrol plants.

In other embodiments, transgenic corn plants having enhanced yield canbe identified by screening using progeny of the transgenic plants overmultiple locations with plants grown under optimal production managementpractices and maximum weed and pest control. A useful target forimproved yield is a 5% to 10% increase in yield as compared to yieldproduced by plants grown from seed for a control plant. Selectionmethods can be applied in multiple and diverse geographic locations, forexample up to 16 or more locations, over one or more planting seasons,for example at least two planting seasons, to statistically distinguishyield improvement from natural environmental effects.

In other embodiments, transgenic corn plants having enhanced water useefficiency can be identified by screening plants in an assay where wateris withheld for a period to induce stress followed by watering to revivethe plants. For example, a useful selection process imposes 3drought/re-water cycles on plants over a total period of 15 days afteran initial stress free growth period of 11 days. Each cycle consists of5 days, with no water being applied for the first four days and a waterquenching on the 5th day of the cycle. The primary phenotypes analyzedby the selection method are the changes in plant growth rate asdetermined by height and biomass during a vegetative drought treatment.

In other embodiments, transgenic corn plants having enhanced coldtolerance can be identified by screening plants in a cold germinationassay and/or a cold tolerance field trial. In a cold germination assaytrays of transgenic and control seeds are placed in a growth chamber at9.7° C. for 24 days (no light). For example, seeds having highergermination rates as compared to the control can be identified as havingenhanced cold tolerance. In a cold tolerance field trial plants withenhanced cold tolerance can be identified from field planting at anearlier date than conventional Spring planting for the field location.For example, seeds are planted into the ground around two weeks beforelocal farmers begin to plant corn so that a significant cold stress isexerted onto the crop, named as cold treatment. Seeds can be plantedunder local optimal planting conditions such that the crop has little orno exposure to cold condition, named as normal treatment. At eachlocation, seeds may be can be planted under both cold and normalconditions preferably with multiple repetitions per treatment.

In other embodiments, transgenic corn plants having seeds with increasedprotein and/or oil levels can be identified by analyzing progeny seedfor protein and/or oil. Near-infrared transmittance spectrometry is anon-destructive, high-throughput method that is useful to determine thecomposition of a bulk seed sample for properties listed in table 1.

TABLE 1 Typical sample(s): Whole grain corn and soybean seeds Typicalanalytical range: Corn - moisture 5-15%, oil 5-20%, protein 5-30%,starch 50-75%, and density 1.0-1.3%. Soybean - moisture 5-15%, oil15-25%, and protein 35-50%.

Although the plant cells and methods of this invention can be applied toany plant cell, plant, seed or pollen, e.g. any fruit, vegetable, grass,tree or ornamental plant, the various aspects of the invention arepreferably applied to corn, soybean, cotton, canola, alfalfa, wheat,rice, sugarcane, and sugar beet plants. In many cases the invention isapplied to corn plants that are inherently resistant to disease from theMal de Rio Cuarto virus or the Puccina sorghi fungus or both.

Homolog Identification

In certain embodiment, the present invention also includesidentification of homologs of proteins encoded by the DNA identified inthe sequence listing which is used to provide transgenic seed and plantshaving enhanced agronomic traits. From the sequence of the homologs,homologous DNA sequence are identified for preparing additionaltransgenic seeds and plants of this invention with enhanced agronomictraits.

An “All Protein Database” are constructed of known protein sequencesusing a proprietary sequence database and the National Center forBiotechnology Information (NCBI) non-redundant amino acid database(nr.aa). For each organism from which a polynucleotide sequence providedherein can be obtained, an “Organism Protein Database” are constructedof known protein sequences of the organism; it is a subset of the AllProtein Database based on the

NCBI taxonomy ID for the Organism.

The All Protein Database are queried using amino acid sequences providedherein as SEQ ID NO: 116779 through SEQ ID NO: 233556 using NCBI“blastp” program with E-value cutoff of 1e⁻⁸. Up to 1000 top hits arekept, and separated by organism names. For each organism other than thatof the query sequence, a list is kept for hits from the query organismitself with a more significant E-value than the best hit of theorganism. The list contain likely duplicated genes of thepolynucleotides provided herein, and is referred to as the Core List.Another list is kept for all the hits from each organism, sorted byE-value, and referred to as the Hit List.

The Organism Protein Database are queried using polypeptide sequencesprovided herein as SEQ ID NO: 116779 through SEQ ID NO: 233556 usingNCBI “blastp” program with E-value cutoff of 1 e⁻⁴. Up to 1000 top hitsare kept. A BLAST searchable database is constructed based on thesehits, and are referred to as “SubDB”. SubDB are queried with eachsequence in the Hit List using NCBI “blastp” program with E-value cutoffof 1 e⁻⁸. The hit with the best E-value are compared with the Core Listfrom the corresponding organism. The hit is deemed a likely ortholog ifit belongs to the Core List, otherwise it is deemed not a likelyortholog and there is no further search of sequences in the Hit List forthe same organism. Homologs from a large number of distinct organismsare identified and reported.

Recombinant DNA constructs can be prepared using the DNA encoding eachof the identified homologs and the constructs can be used to preparemultiple events of transgenic corn, soybean, canola, cotton and othertransgenic plants mentioned. Plants can be regenerated from thetransformed plant cells and used to produce progeny plants and seed thatare screened for enhanced water use efficiency, enhanced cold tolerance,increased yield, enhanced nitrogen use efficiency, enhanced seed proteinand enhanced seed oil. From each group of multiple events of transgenicplants with a specific recombinant DNA for a homolog the event thatproduces the greatest enhancement in yield, water use efficiency,nitrogen use efficiency, enhanced cold tolerance, enhanced seed proteinand enhanced seed oil is identified and progeny seed can be selected forcommercial development.

Pfam Module Annotation

The amino acid sequence of the expressed proteins shown to be associatedwith an enhanced trait are analyzed for Pfam protein family against thecurrent Pfam collection of multiple sequence alignments and hiddenMarkov models using the HMMER software in the appended computer listing.The Pfam domain modules and individual protein domain for the proteinsshown in the sequence listing. The Hidden Markov model databases for theidentified patent families are known to a skilled artisan allowingidentification of other homologous proteins and their cognate encodingDNA to enable the full breadth of the invention for a person of ordinaryskill in the art. Certain proteins are identified by a single Pfamdomain and others by multiple Pfam domains.

In one aspect, the present invention includes recombinant DNA constructshaving a polynucleotide encoding a protein that has an amino acidsequence having at least 95% identity over at least 95% of the length ofa reference sequence selected from the group consisting of SEQ ID NO:116779-233556 when said amino acid sequence is aligned with saidreference sequence.

In another aspect, the present invention includes a mixture having plantcells, and an antibody to a protein produced in said cells wherein saidprotein has an amino acid sequence that has at least 95% identity overat least 95% of the length of a reference sequence selected from thegroup consisting of SEQ ID NO: 116779-233556 when said amino acidsequence is aligned to said reference sequence.

Yet another aspect of the present invention includes recombinant DNAconstructs having a promoter that is functional in a plant cell and thatis operably linked to a polynucleotide that: (a) encodes a proteinhaving an amino acid sequence having at least 95% identity over at least95% of the length of a reference sequence selected from the groupconsisting of SEQ ID NO: 116779-233556, when said amino acid sequence isaligned to said reference sequence; or is transcribed into an RNAmolecule that suppresses the level of an endogenous protein that has anamino acid sequence that is at least 95% identical over at least 95% ofthe length of a reference sequence of SEQ ID NO: 116779-233556, whensaid amino acid sequence is aligned to said reference sequence; andwherein said construct is stably integrated into a chromosome in a plantcell nucleus. In some aspect, this invention includes a transgenic plantcell having the recombinant DNA construct wherein said DNA constructprovides for an enhanced trait as compared to control plants; andwherein said enhanced trait is enhanced water use efficiency, enhancedcold tolerance, increased yield, enhanced nitrogen use efficiency,enhanced seed protein or enhanced seed oil. The DNA construct furthercan express a protein that provides tolerance from exposure to anherbicide (e.g., a glyphosate, dicamba, or glufosinate compound) havingan agent applied at levels that are lethal to a wild type of said plantcell nucleus.

In some aspects, this invention provides transgenic plants (e.g., corn,soybean, cotton, canola, alfalfa, wheat, rice, sugarcane, or sugar beetplant), that is homozygous for said recombinant DNA and have a pluralityof plant cells.

Yet in another aspect, this invention includes transgenic pollen grainshaving a haploid derivative of a plant cell nucleus having a chromosomecomprising the recombinant DNA construct.

In one aspect, this inventions provides a method for manufacturingnon-natural, transgenic seed (e.g., corn, soybean, cotton, canola,alfalfa, wheat, rice, sugarcane, or sugar beet seed) that can be used toproduce a crop of transgenic plants with an enhanced trait resultingfrom expression of a stably-integrated, recombinant DNA construct, saidmethod includes (a) screening a population of plants for said enhancedtrait and said recombinant DNA, wherein individual plants in saidpopulation exhibit said trait at a level less than, essentially the sameas or greater than the level that said trait is exhibited in controlplants which do not contain said recombinant DNA, wherein said enhancedtrait is selected from the group of enhanced traits consisting ofenhanced water use efficiency, enhanced cold tolerance, increased yield,enhanced nitrogen use efficiency, enhanced seed protein and enhancedseed oil; (b) selecting from said population one or more plants thatexhibit said trait at a level greater than the level that said trait isexhibited in control plants, and (c) collecting seed from selected plantfrom step b.

Yet in another aspect, the method of this invention further includes:(a) verifying that said recombinant DNA is stably integrated in saidselected plants, and (b) analyzing tissue of said selected plant todetermine the expression or suppression of a protein having the functionof a protein having an amino acid sequence selected from the groupconsisting of one of SEQ ID NOs: 116779-233556.

In one aspect, this invention provides a method of producing hybrid cornseed including: (a) acquiring hybrid corn seed from an herbicidetolerant corn plant which also has a stably-integrated, recombinant DNAconstruct; (b) producing corn plants from said hybrid corn seed, whereina fraction of the plants produced from said hybrid corn seed ishomozygous for said recombinant DNA, a fraction of the plants producedfrom said hybrid corn seed is hemizygous for said recombinant DNA, and afraction of the plants produced from said hybrid corn seed has none ofsaid recombinant DNA; (c) selecting corn plants which are homozygous andhemizygous for said recombinant DNA by treating with an herbicide; (d)collecting seed from herbicide-treated-surviving corn plants andplanting said seed to produce further progeny corn plants; (e) repeatingsteps (c) and (d) at least once to produce an inbred corn line; and (f)crossing said inbred corn line with a second corn line to produce hybridseed.

In some aspect, this invention provides substantially purified nucleicacid molecule having a nucleic acid sequence wherein said nucleic acidsequence exhibits 95% or greater identity to a nucleic acid sequenceselected from the group consisting of SEQ ID NO: 1 through SEQ ID NO:116778 and sequences complementary to SEQ ID NO: 1 through SEQ ID NO:116778.

It is contemplated that any embodiment discussed in this specificationcan be implemented with respect to any method, kit, reagent, orcomposition of the invention, and vice versa. Furthermore, compositionsof the invention can be used to achieve methods of the invention.

It will be understood that particular embodiments described herein areshown by way of illustration and not as limitations of the invention.The principal features of this invention can be employed in variousembodiments without departing from the scope of the invention. Thoseskilled in the art will recognize, or be able to ascertain using no morethan routine experimentation, numerous equivalents to the specificprocedures described herein. Such equivalents are considered to bewithin the scope of this invention and are covered by the claims.

All publications and patent applications mentioned in the specificationare indicative of the level of skill of those skilled in the art towhich this invention pertains. All publications and patent applicationsare herein incorporated by reference to the same extent as if eachindividual publication or patent application was specifically andindividually indicated to be incorporated by reference.

The use of the word “a” or “an” when used in conjunction with the term“comprising” in the claims and/or the specification may mean “one,” butit is also consistent with the meaning of “one or more,” “at least one,”and “one or more than one.” The use of the term “or” in the claims isused to mean “and/or” unless explicitly indicated to refer toalternatives only or the alternatives are mutually exclusive, althoughthe disclosure supports a definition that refers to only alternativesand “and/or.” Throughout this application, the term “about” is used toindicate that a value includes the inherent variation of error for thedevice, the method being employed to determine the value, or thevariation that exists among the study subjects.

As used in this specification and claim(s), the words “comprising” (andany form of comprising, such as “comprise” and “comprises”), “having”(and any form of having, such as “have” and “has”), “including” (and anyform of including, such as “includes” and “include”) or “containing”(and any form of containing, such as “contains” and “contain”) areinclusive or open-ended and do not exclude additional, unrecitedelements or method steps.

The term “or combinations thereof” as used herein refers to allpermutations and combinations of the listed items preceding the term.For example, “A, B, C, or combinations thereof” is intended to includeat least one of: A, B, C, AB, AC, BC, or ABC, and if order is importantin a particular context, also BA, CA, CB, CBA, BCA, ACB, BAC, or CAB.Continuing with this example, expressly included are combinations thatcontain repeats of one or more item or term, such as BB, AAA, MB, BBC,AAABCCCC, CBBAAA, CABABB, and so forth. The skilled artisan willunderstand that typically there is no limit on the number of items orterms in any combination, unless otherwise apparent from the context.

All of the compositions and/or methods disclosed and claimed herein canbe made and executed without undue experimentation in light of thepresent disclosure. While the compositions and methods of this inventionhave been described in terms of preferred embodiments, it will beapparent to those of skill in the art that variations may be applied tothe compositions and/or methods and in the steps or in the sequence ofsteps of the method described herein without departing from the concept,spirit and scope of the invention. All such similar substitutes andmodifications apparent to those skilled in the art are deemed to bewithin the spirit, scope and concept of the invention as defined by theappended claims.

1. A recombinant DNA construct comprising a polynucleotide encoding aprotein that has an amino acid sequence having at least 95% identityover at least 95% of the length of a reference sequence selected fromthe group consisting of SEQ ID NO: 116779-233556 when said amino acidsequence is aligned with said reference sequence.
 2. (canceled)
 3. Arecombinant DNA construct comprising a promoter that is functional in aplant cell and that is operably linked to a polynucleotide that: (a)encodes a protein having an amino acid sequence having at least 95%identity over at least 95% of the length of a reference sequenceselected from the group consisting of SEQ ID NO: 116779-233556, whensaid amino acid sequence is aligned to said reference sequence; or (b)is transcribed into an RNA molecule that suppresses the level of anendogenous protein that has an amino acid sequence that is at least 95%identical over at least 95% of the length of a reference sequence of SEQID NO: 116779-233556, when said amino acid sequence is aligned to saidreference sequence; and wherein said construct is stably integrated intoa chromosome in a plant cell nucleus.
 4. A transgenic plant comprisingthe recombinant DNA construct of claim 3 wherein said DNA constructprovides for an enhanced trait as compared to control plants; andwherein said enhanced trait is enhanced water use efficiency, enhancedcold tolerance, increased yield, enhanced nitrogen use efficiency,enhanced seed protein or enhanced seed oil.
 5. The plant of claim 4further comprising DNA expressing a protein that provides tolerance fromexposure to an herbicide comprising an agent applied at levels that arelethal to a wild type of said plant cell nucleus.
 6. The plant of claim5 wherein the agent of said herbicide is a glyphosate, dicamba, orglufosinate compound.
 7. (canceled)
 8. (canceled)
 9. A transgenic seedcomprising the recombinant DNA construct of claim
 3. 10. The transgenicseed of claim 9 from a corn, soybean, cotton, canola, alfalfa, wheat,rice, sugarcane, or sugar beet plant.
 11. A transgenic pollen graincomprising a haploid derivative of a plant cell nucleus having achromosome comprising the recombinant DNA construct of claim
 3. 12. Amethod for manufacturing non-natural, transgenic seed that can be usedto produce a crop of transgenic plants with an enhanced trait resultingfrom expression of a stably-integrated, recombinant DNA construct ofclaim 3, said method comprising: (a) screening a population of plantsfor said enhanced trait and said recombinant DNA, wherein individualplants in said population exhibit said trait at a level less than,essentially the same as or greater than the level that said trait isexhibited in control plants which do not contain said recombinant DNA,wherein said enhanced trait is selected from the group of enhancedtraits consisting of enhanced water use efficiency, enhanced coldtolerance, increased yield, enhanced nitrogen use efficiency, enhancedseed protein and enhanced seed oil; (b) selecting from said populationone or more plants that exhibit said trait at a level greater than thelevel that said trait is exhibited in control plants, and (c) collectingseed from selected plant from step b.
 13. The method of claim 12 whereinsaid method for manufacturing said transgenic seed further comprises:(a) verifying that said recombinant DNA is stably integrated in saidselected plants, and (b) analyzing tissue of said selected plant todetermine the expression or suppression of a protein having the functionof a protein having an amino acid sequence selected from the groupconsisting of one of SEQ ID NOs:116779-233556.
 14. The method of claim13 wherein said seed is corn, soybean, cotton, canola, alfalfa, wheat,rice, sugarcane, or sugar beet seed.
 15. (canceled)
 16. (canceled)